Pesquisa | Portal Regional da BVS

1.

Quantifying Articulatory Working Space in Individuals Surgically Treated for Oral Cancer With Electromagnetic Articulography.

Tienkamp, Thomas B; Rebernik, Teja; Halpern, Bence M; van Son, Rob J J H; Wieling, Martijn; Witjes, Max J H; de Visscher, Sebastiaan A H J; Abur, Defne.

J Speech Lang Hear Res ; 67(2): 384-399, 2024 Feb 12.

Artigo em Inglês | MEDLINE | ID: mdl-38289853

RESUMO

PURPOSE: The purpose of this study was to quantify sentence-level articulatory kinematics in individuals treated for oral squamous cell carcinoma (ITOC) compared to control speakers while also assessing the effect of treatment site (jaw vs. tongue). Furthermore, this study aimed to assess the relation between articulatory-kinematic measures and self-reported speech problems. METHOD: Articulatory-kinematic data from the tongue tip, tongue back, and jaw were collected using electromagnetic articulography in nine Dutch ITOC and eight control speakers. To quantify articulatory kinematics, the two-dimensional articulatory working space (AWS; in mm2), one-dimensional anteroposterior range of motion (AP-ROM; in mm), and superior-inferior range of motion (SI-ROM in mm) were calculated and examined. Self-reported speech problems were assessed with the Speech Handicap Index (SHI). RESULTS: Compared to a sex-matched control group, ITOC showed significantly smaller AWS, AP-ROM, and SI-ROM for both the tongue tip and tongue back sensor, but no significant differences were observed for the jaw sensor. This pattern was found for both individuals treated for tongue and jaw tumors. Moderate nonsignificant correlations were found between the SHI and the AWS of the tongue back and jaw sensors. CONCLUSIONS: Despite large individual variation, ITOC showed reduced one- and two-dimensional tongue, but not jaw, movements compared to control speakers and treatment for tongue and jaw tumors resulted in smaller tongue movements. A larger sample size is needed to establish a more generalizable connection between the AWS and the SHI. Further research should explore how these kinematic changes in ITOC are related to acoustic and perceptual measures of speech.

Assuntos

Carcinoma de Células Escamosas , Neoplasias Maxilomandibulares , Neoplasias Bucais , Humanos , Inteligibilidade da Fala , Medida da Produção da Fala/métodos , Neoplasias Bucais/cirurgia , Acústica da Fala , Fala , Língua/cirurgia , Fenômenos Biomecânicos , Fenômenos Eletromagnéticos , Arcada Osseodentária

2.

The Effect of Rating Method on Reliability of Judgments of Strain Across Populations.

Sauder, Cara L; Kapsner-Smith, Mara R; Simmons, Emily; Meyer, Tanya; Doyle, Philip C; Eadie, Tanya L.

Am J Speech Lang Pathol ; 33(1): 393-405, 2024 Jan 03.

Artigo em Inglês | MEDLINE | ID: mdl-38060689

RESUMO

PURPOSE: Variability in auditory-perceptual ratings of voice limits their utility, with the poorest reliability often noted for vocal strain. The purpose of this study was to determine whether an experimental method, called visual sort and rate (VSR), promoted stronger rater reliability than visual analog scale (VAS), for ratings of strain in two clinical populations: adductor laryngeal dystonia (ADLD) and vocal hyperfunction (VH). METHOD: Connected speech samples from speakers with ADLD and VH as well as age- and sex-matched controls were selected from a database. Fifteen inexperienced listeners rated strain for two speaker sets (25 ADLD speakers and five controls; 25 VH speakers and five controls) across four rating blocks: VAS-ADLD, VSR-ADLD, VAS-VH, and VSR-VH. For the VAS task, listeners rated each speaker for strain using a vertically oriented 100-mm VAS. For the VSR task, stimuli were distributed into sets of samples with a range of severities in each set. Listeners sorted and ranked samples for strain within each set, and final ratings were captured on a vertically oriented 100-mm VAS. Intrarater reliability (Pearson's r) and interrater variability (mean of the squared differences between a listener's ratings and group mean ratings) were compared across rating methods and populations using two repeated-measures analyses of variance. RESULTS: Intrarater reliability of strain was significantly stronger when listeners used VSR compared to VAS; listeners also showed significantly better intrarater reliability in ADLD than VH. Listeners demonstrated significantly less interrater variability (better reliability) when using VSR compared to VAS. No significant effect of population or interactions was found between listeners for measures of interrater variability. CONCLUSIONS: VSR increases intrarater reliability for ratings of vocal strain in speakers with VH and ADLD. VSR decreases variability of auditory-perceptual judgments of strain between inexperienced listeners in these clinical populations. Future research should determine whether benefits of VSR extend to voice clinicians and/or clinical settings.

Assuntos

Disfonia , Percepção da Fala , Voz , Humanos , Qualidade da Voz , Julgamento , Reprodutibilidade dos Testes , Medida da Produção da Fala/métodos

3.

Do Adult Naïve Listeners Perceive Differences in Speech Before and After Therapy for Cleft Palate Speech Disorders? A Reliability Study of Perceptual Speech Ratings.

Alighieri, Cassandra; Meerschaert, Silke; Van Lierde, Kristiane.

J Speech Lang Hear Res ; 67(1): 116-125, 2024 Jan 08.

Artigo em Inglês | MEDLINE | ID: mdl-37992413

RESUMO

PURPOSE: This study compared the interrater reliability of adult naïve listeners' perceptual assessments of different speech variables in children with a cleft palate with or without a cleft lip (CP ± L). In addition, the study investigated whether the listeners were able to perceive differences in these speech variables before and after speech therapy for cleft palate speech disorders. METHOD: Thirty-four speech samples of 14 children with a CP ± L (14 samples collected immediately before 10 hr of speech intervention, 14 samples collected immediately after speech intervention, and six randomly selected samples that were duplicated to assess intrarater reliability) were perceptually assessed by 26 adult naïve listeners. The listening panel consisted of nine men and 17 women (age range: 18-51 years). The speech variables included speech understandability, speech acceptability, hypernasality, hyponasality, nasal airflow, and articulation, which were assessed on a visual analog scale. Furthermore, the need for speech therapy was assessed. RESULTS: Good to very good interrater reliability was observed for the naïve listeners' ratings of all speech variables. A significant time effect was found for the pre- and postevolution of the speech variables "speech understandability," "speech acceptability," "nasal airflow," and "articulation." This time effect indicates an improvement of these variables postintervention. According to the naïve listeners, children were less in need of additional speech therapy after the 10-hr intervention period compared to assessments before this intervention period. CONCLUSIONS: Adult naïve listeners perceptually identified an improvement in different speech variables after 10 hr of cleft palate speech therapy. These findings confirm previous assessments of expert speech-language pathologists and suggest that speech improvements after cleft palate speech therapy can also be perceived by communication partners outside the therapy room. Perceptual ratings of naïve listeners can, thus, be used to add life-situation significance to the assessments of experts. Future research could include both expert raters and caregivers or relatives of children with a CP ± L in listening panels, as previous knowledge on craniofacial anomalies may lead to different results.

Assuntos

Fenda Labial , Fissura Palatina , Distúrbios da Voz , Masculino , Adulto , Criança , Humanos , Feminino , Adolescente , Adulto Jovem , Pessoa de Meia-Idade , Fissura Palatina/complicações , Fissura Palatina/terapia , Fala , Reprodutibilidade dos Testes , Medida da Produção da Fala/métodos , Distúrbios da Fala/etiologia , Distúrbios da Fala/terapia , Fenda Labial/complicações , Fenda Labial/terapia

4.

[Evaluation of the results of treatment of patients with functional dysphonia using a cepstral test]. / Otsenka rezul'tatov lecheniya patsientov s funktsional'noi disfoniei s pomoshch'yu kepstral'nogo testa.

Chernobelsky, S I; Petrova, I A.

Vestn Otorinolaringol ; 88(5): 23-26, 2023.

Artigo em Russo | MEDLINE | ID: mdl-37970766

RESUMO

In order to evaluate the effectiveness of the treatment in patients with functional dysphonia, the Cepstral Peak Prominence (CPP) test was used. Twenty dysphonic women aged from 18 to 47 years were under observation. The control group consisted of 20 healthy women of close age. Patients underwent 5-7 sessions electrostimulation of laryngeal muscles and phonopedic treatment, after which a complete restoration of the voice was noted. The Praat clinical program was used, installed on a Hewlett-Packard 630 laptop (Pentium B960, 2.2 GHz). A SHURE SM94 condenser microphone was used as well. In the control group, the results were as follows: M=7.49 (SD=1.26) dB. In the main group before treatment: M=5.00 (SD=1.07) dB, after treatment: M=7.95 (SD=1.34) dB. Differences in KT values in the main group before and after treatment (5.00 dB and 7.95 dB, respectively) were significant at p<0.0001. Differences in KT values in the main group before treatment (5.00 dB) and in the control group (7.49 dB) were significant at p<0.0001. Differences in KT values in the main group after treatment (7.95 dB) and in the control group (7.49 dB) were not significant at p>0.05. The study showed high sensitivity of the method. The CPP data after treatment were higher than those before treatment and did not differ from the control ones. It is concluded that CPP is a highly sensitive method for evaluating the degree of periodicity of an acoustic signal and can be used to evaluate the effectiveness of treatment in patients with functional dysphonia.

Assuntos

Disfonia , Voz , Humanos , Feminino , Disfonia/diagnóstico , Disfonia/terapia , Acústica da Fala , Medida da Produção da Fala/métodos , Acústica

5.

Linguistic features of stuttering during spontaneous speech.

Warner, Haley J; Shroff, Ravi; Zuanazzi, Arianna; Arenas, Richard M; Jackson, Eric S.

J Fluency Disord ; 78: 106016, 2023 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-37852018

RESUMO

PURPOSE: Previous work shows that linguistic features (e.g., word length, word frequency) impact the predictability of stuttering events. Most of this work has been conducted using reading tasks. Our study examined how linguistic features impact the predictability of stuttering events during spontaneous speech. METHODS: The data were sourced from the FluencyBank database and consisted of interviews with 35 adult stutterers (27,009 words). Three logistic regression mixed models were fit as the primary analyses: one model with four features (i.e., initial phoneme, grammatical function, word length, and word position within a sentence), a second model with six features (i.e., the features from the previous model plus word frequency and neighborhood density), and a third model with nine features (i.e., the features from the previous model plus bigram frequency, word concreteness, and typical age of word acquisition). We compared our models using the Area Under the Curve statistic. RESULTS: The four-feature model revealed that initial phoneme, grammatical function, and word length were predictive of stuttering events. The six-feature model revealed that initial phoneme, word length, word frequency, and neighborhood density were predictive of stuttering events. The nine-feature model was not more predictive than the six-feature model. CONCLUSION: Linguistic features that were previously found to be predictive of stuttering during reading were predictive of stuttering during spontaneous speech. The results indicate the influence of linguistic processes on the predictability of stuttering events such that words associated with increased planning demands (e.g., longer words, low frequency words) were more likely to be stuttered.

Assuntos

Fala , Gagueira , Adulto , Humanos , Gagueira/diagnóstico , Medida da Produção da Fala/métodos , Linguística/métodos , Idioma

6.

Evaluating the Effect of Voice Quality Covariance on Auditory-Perceptual Evaluation Using a Novel Two-Dimensional Magnitude Estimation Task.

Anand, Supraja; Park, Yeonggwang; Shrivastav, Rahul; Eddins, David A.

J Speech Lang Hear Res ; 66(12): 4849-4859, 2023 Dec 11.

Artigo em Inglês | MEDLINE | ID: mdl-37902504

RESUMO

PURPOSE: Most people with dysphonia present with voices that vary along more than one voice quality (VQ) dimension. This study sought to examine the effect of covariance between breathy and rough VQ in natural voices. METHOD: A two-dimensional matrix of 16 /a/ vowels was selected such that two VQ dimensions (breathiness and roughness) were sampled on a 4-point severity scale (none, mild, moderate, and severe). Ten listeners evaluated 480 stimuli (16 stimuli × 10 repetitions × 3 blocks) on one-dimensional magnitude estimation (1DME) tasks and a novel two-dimensional magnitude estimation (2DME) task that allowed for simultaneous measurement of breathiness and roughness. RESULTS: Data indicated high intra- and interrater reliabilities for both breathiness and roughness in the 2DME and 1DME tasks. Correlation analyses revealed a strong correlation between 2DME and 1DME judgments for breathiness and roughness (r > .95). There was also a minimal correlation between breathy and rough VQ in the 2DME task (r < .10). CONCLUSIONS: Covarying roughness or breathiness had less impact on the perception of the other VQ in natural dysphonic voices in 2DME compared to 1DME. An understanding and quantification of the perceptual interactions among the dimensions will aid in the refinement of computational models and in the establishment of the validity of clinical scales for VQ perception.

Assuntos

Disfonia , Percepção da Fala , Humanos , Qualidade da Voz , Medida da Produção da Fala/métodos , Reprodutibilidade dos Testes , Disfonia/diagnóstico , Julgamento , Acústica da Fala

7.

Profile of fluency in spontaneous speech, reading, and retelling of texts by adults who stutter. / Perfil da fluência na fala espontânea, leitura e no reconto de textos de adultos que gaguejam.

Silva, Samuel Lopes da; Alves, Luciana Mendonça; Britto, Denise Brandão de Oliveira E.

Codas ; 35(5): e20220009, 2023.

Artigo em Português, Inglês | MEDLINE | ID: mdl-37792751

RESUMO

PURPOSE: to describe the profile of fluency concerning the typology of disfluencies, speed, and frequency of disruptions in spontaneous speech, reading, and retelling; to compare the fluency profile in adults who stutter in spontaneous speech, reading, and retelling of text. METHODS: The present work is a cross-sectional comparative study with a sample composed of 15 adults who stutter of both sexes, with higher education or equivalent to complete elementary school II. Samples were collected in the tasks of spontaneous speech, reading, and text retelling through video calls made individually with the participants. The first 200 syllables expressed in each task were transcribed and analyzed according to the Fluency Profile Assessment Protocol (FPAP). The study compared the frequency of common and stuttering disfluencies and the speed in the different tasks surveyed. The Kruskal & Wallis test was used together with Duncan's multiple comparisons test to compare the medians and verify possible differences between the tasks researched with a significance level of 5%. RESULTS: The reading task presented a lower number of common disfluencies and a percentage of speech discontinuity about spontaneous speech and retelling tasks. No statistically significant differences were found between stuttering disfluencies in the three tasks surveyed. CONCLUSION: This study showed that there are differences in the occurrence of common disfluencies - hesitations, interjections, and revisions - and in the percentage of speech discontinuity during an oral reading of adults who stutter concerning spontaneous speech and text retelling.

OBJETIVO: descrever o perfil da fluência em relação à tipologia das disfluências, velocidade e frequência de rupturas na fala espontânea, na leitura e no reconto; comparar o perfil da fluência em adultos que gaguejam na fala espontânea, na leitura e no reconto de texto. MÉTODO: O trabalho é um estudo transversal comparativo com amostra composta por 15 adultos que gaguejam de ambos os sexos, com formação superior ou equivalente ao ensino fundamental II completo. Foram coletadas amostras nas tarefas de fala espontânea, leitura e reconto de texto por meio de video chamadas realizadas individualmente. As 200 primeiras sílabas expressas de cada tarefa foram transcritas e analisadas segundo o Protocolo de Avaliação do Perfil da Fluência (PAPF). O estudo comparou a frequência das disfluências comuns e gagas e a velocidade nas tarefas pesquisadas. Adotou-se o teste de Kruskal & Wallis em conjunto com o de comparações múltiplas de Duncan para comparar as medianas e verificar possíveis diferenças entre as tarefas pesquisadas com nível de significância de 5%. RESULTADOS: A tarefa de leitura apresentou menor número de disfluências comuns e percentual de descontinuidade de fala em relação às tarefas de fala espontânea e reconto. Não foram encontradas diferenças estatisticamente significantes entre as disfluências gagas nas três tarefas pesquisadas. CONCLUSÃO: Este trabalho mostrou que existem diferenças na ocorrência das disfluências comuns - hesitações, interjeições e revisões - e no percentual de descontinuidade de fala durante a leitura oral de adultos que gaguejam em relação à fala espontânea e ao reconto de texto.

Assuntos

Fala , Gagueira , Masculino , Feminino , Adulto , Humanos , Leitura , Estudos Transversais , Medida da Produção da Fala/métodos

8.

Proposal of requirements for the development of a training simulator for the auditory-perceptual judgment of voice. / Proposição de requisitos para o desenvolvimento de um simulador de treinamento para julgamento perceptivo-auditivo da voz.

Paiva, Maxsuel Alves Avelino de; Machado, Liliane Dos Santos; Lopes, Leonardo Wanderley.

Codas ; 35(6): e20220209, 2023.

Artigo em Português, Inglês | MEDLINE | ID: mdl-37820100

RESUMO

PURPOSE: to identify a set of requirements for the development of an auditory-perceptual training simulator (APT) based on the experience of professors who provide APT. METHODS: This is a cross-sectional, descriptive study with a quantitative approach. Twenty-two professors answered an online questionnaire containing 31 items related to APT, involving items about the professional profile, conditions for APT in undergraduate and postgraduate courses in Speech Therapy, APT structure, and evaluation of the APT effect. RESULT: it was observed that there is a variation in APT procedures performed in Brazil. The main requirements indicated by the respondents for the APT involve the use of synthesized voices in the initial moments, followed by human voices later; the use of speech tasks with sustained vowels and connected speech; the insertion of complementary information such as gender, age, the profession of the speaker and the spectrography of the vocal signal; training with a minimum time of six hours; the evaluation of the training effect by comparing intra- and inter-judge agreement before and after training; the addition of the parameters of general degree of vocal deviation, roughness, breathiness, and strain; the use of validated continuous and numerical scales; and offering it from the second year of the undergraduate program. CONCLUSION: although there is variability in the response of experts, a minimum set of requirements indicated for performing APT with new judges was identified.

OBJETIVO: identificar um conjunto de requisitos para o desenvolvimento de um simulador de treinamento perceptivo-auditivo (TPA) a partir da experiência de docentes que realizam o TPA. MÉTODO: Trata-se de um estudo transversal, descritivo, com abordagem quantitativa. Vinte e dois docentes responderam um questionário online contendo 31 itens relacionados ao TPA, envolvendo itens sobre o perfil profissional, condições para o TPA nos cursos de graduação e pós-graduação em Fonoaudiologia, estrutura do TPA, avaliação do efeito do TPA. RESULTADO: observou-se que existe variação nos procedimentos de TPA realizados no Brasil. Os principais requisitos indicados pelos respondentes para o TPA envolvem o uso de vozes sintetizadas nos momentos iniciais, seguindo para vozes humanas posteriormente; a utilização de tarefas de fala com vogais sustentadas e fala encadeada; a inserção de informações complementares tais como o gênero, idade, profissão do falante e a espectrografia do sinal vocal; treinamento com tempo mínimo de seis horas; a avaliação do efeito do treinamento pela comparação da concordância intra e inter-juizes pré e pós treinamento; a adição dos parâmetros de grau geral de desvio vocal, rugosidade, soprosidade e tensão; a utilização de escalas contínuas e numéricas validadas; e ser realizado a partir do segundo ano de graduação. CONCLUSÃO: embora haja uma variabilidade da resposta dos especialistas, foi identificado um conjunto mínimo de requisitos indicados para a realização de TPA com novos juízes.

Assuntos

Disfonia , Percepção da Fala , Humanos , Acústica da Fala , Julgamento , Estudos Transversais , Qualidade da Voz , Medida da Produção da Fala/métodos , Reprodutibilidade dos Testes , Variações Dependentes do Observador

9.

A Comparison of Sound Production Treatment and Metrical Pacing Therapy for Apraxia of Speech: A Single-Case Experimental Design.

King, Charlotte R; Wambaugh, Julie L; Maas, Edwin.

Am J Speech Lang Pathol ; 32(5S): 2493-2511, 2023 10 17.

Artigo em Inglês | MEDLINE | ID: mdl-37656150

RESUMO

PURPOSE: The purpose of this investigation was to compare the effects of two specific treatment protocols for acquired apraxia of speech (AOS): Sound Production Treatment (SPT) and Metrical Pacing Therapy (MPT), and to examine changes in communicative participation. METHOD: Four speakers with chronic AOS and aphasia were each administered SPT and MPT in a replicated crossover design (ABACA/ACABA) with nonconcurrent multiple baselines across participants and behaviors. Treatment outcomes were compared with respect to whole word correctness (WWC) for treated and untreated multisyllabic word targets. Speech intelligibility was assessed using the Chapel Hill Multilingual Intelligibility Test, and communicative participation was measured using the Communicative Participation Item Bank at baseline, washout, and follow-up phases. RESULTS: Three of the four participants experienced statistically significant improvements in WWC with SPT, and three of the four participants with MPT. Based on a priori criteria, three participants demonstrated relatively greater benefit from SPT and one participant demonstrated relatively greater benefit from MPT. There were measurable improvements in intelligibility following SPT for three of the four participants. Only one participant in this investigation reported a significant change in communicative participation, and only following MPT. CONCLUSIONS: This study demonstrated that individuals in the chronic stages of AOS can benefit from both SPT and MPT, corroborating prior research on articulatory kinematic and rate and/or rhythm control treatment approaches. It contributes a comparison of two protocols for AOS with respect to whole word targets, intelligibility, and individual self-report of communicative participation changes. More participants showed a relative advantage of SPT over MPT. One individual reported communicative participation improvement after MPT. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.23971929.

Assuntos

Afasia , Apraxias , Humanos , Fala , Projetos de Pesquisa , Fonoterapia/métodos , Apraxias/diagnóstico , Apraxias/terapia , Afasia/terapia , Inteligibilidade da Fala , Medida da Produção da Fala/métodos

10.

Tongue Shape Complexity in Children With and Without Speech Sound Disorders.

Dokovova, Marie; Sugden, Ellie; Cartney, Gemma; Schaeffler, Sonja; Cleland, Joanne.

J Speech Lang Hear Res ; 66(7): 2164-2183, 2023 07 12.

Artigo em Inglês | MEDLINE | ID: mdl-37267440

RESUMO

PURPOSE: This study investigates the hypothesis that younger speakers and speakers with more severe speech sound disorders are more likely to use simpler (undifferentiated) tongue gestures due to difficulties with, or immaturity of, lingual motor control. METHOD: The hypothesis is tested using cross-sectional secondary data analysis of synchronous audio and high-speed ultrasound recordings from children with idiopathic speech sound disorders (n = 30, aged 5;0-12;11 [years;months]) and typically developing children (n = 29, aged 5;8-12;10), producing /a/, /t/, /É¹/, /l/, /s/, and /Ê/ in an intervocalic /aCa/ environment. Tongue shape complexity is measured using NINFL (Number of INFLections) and modified curvature index (MCI) from splines fitted to ultrasound images at the point of maximal lingual gesture. Age, perceived accuracy, and consonant are used as predictors. RESULTS: The results suggest that as age increases, children with speech sound disorders have lower MCI compared to typically developing children. Increase in age also led to decrease of MCI for the typically developing group. In the group of children with speech sound disorders, perceptually incorrect /É¹/ productions have lower MCI than correct productions, relative to /a/. CONCLUSIONS: There is some evidence of systematic tongue shape complexity differences between typically developing children and children with speech sound disorders when accounting for increase in age. Among children with speech sound disorders, increase in age and perceptually incorrect consonant realizations are associated with decreasing tongue shape complexity.

Assuntos

Transtorno Fonológico , Humanos , Criança , Transtorno Fonológico/diagnóstico por imagem , Estudos Transversais , Língua/diagnóstico por imagem , Ultrassonografia/métodos , Gestos , Fala , Fonética , Medida da Produção da Fala/métodos

11.

An Experimental Analysis on Multicepstral Projection Representation Strategies for Dysphonia Detection.

Contreras, Rodrigo Colnago; Viana, Monique Simplicio; Fonseca, Everthon Silva; Dos Santos, Francisco Lledo; Zanin, Rodrigo Bruno; Guido, Rodrigo Capobianco.

Sensors (Basel) ; 23(11)2023 May 30.

Artigo em Inglês | MEDLINE | ID: mdl-37299922

RESUMO

Biometrics-based authentication has become the most well-established form of user recognition in systems that demand a certain level of security. For example, the most commonplace social activities stand out, such as access to the work environment or to one's own bank account. Among all biometrics, voice receives special attention due to factors such as ease of collection, the low cost of reading devices, and the high quantity of literature and software packages available for use. However, these biometrics may have the ability to represent the individual impaired by the phenomenon known as dysphonia, which consists of a change in the sound signal due to some disease that acts on the vocal apparatus. As a consequence, for example, a user with the flu may not be properly authenticated by the recognition system. Therefore, it is important that automatic voice dysphonia detection techniques be developed. In this work, we propose a new framework based on the representation of the voice signal by the multiple projection of cepstral coefficients to promote the detection of dysphonic alterations in the voice through machine learning techniques. Most of the best-known cepstral coefficient extraction techniques in the literature are mapped and analyzed separately and together with measures related to the fundamental frequency of the voice signal, and its representation capacity is evaluated on three classifiers. Finally, the experiments on a subset of the Saarbruecken Voice Database prove the effectiveness of the proposed material in detecting the presence of dysphonia in the voice.

Assuntos

Disfonia , Voz , Humanos , Disfonia/diagnóstico , Acústica da Fala , Qualidade da Voz , Medida da Produção da Fala/métodos

12.

Normative Values of Cepstral Peak Prominence Measures in Typical Speakers by Sex, Speech Stimuli, and Software Type Across the Life Span.

Buckley, Daniel P; Abur, Defne; Stepp, Cara E.

Am J Speech Lang Pathol ; 32(4): 1565-1577, 2023 07 10.

Artigo em Inglês | MEDLINE | ID: mdl-37257202

RESUMO

PURPOSE: The purpose of this study was to determine normative values for cepstral peak prominence measures across the life span as a function of sex using clinically relevant stimuli (/É/, /i/, and two sentences of The Rainbow Passage) and two commonly used software types: Praat (Version 6.0.50) and Analysis of Dysphonia in Speech and Voice (ADSV). METHOD: One hundred fifty speakers (75 males, 75 females; evenly distributed into three age groups) without voice disorders aged 18-91 years were recorded via headset microphone in a sound-treated booth. Cepstral measures were analyzed using common analysis methods in Praat and ADSV by sex, stimuli, and software type. Kruskal-Wallis tests and post hoc Mood's Median tests for significant factors were performed on cepstral measures to assess the effects of age group, sex, stimuli, and software type. RESULTS: The results revealed statistically significant effects of sex, stimuli, and software type on cepstral measures, but no statistical effect of age group on cepstral values. Females had lower average cepstral values compared to males. Across stimuli, the highest average cepstral measure was found for sustained /É/, followed by sustained /i/, and then of the two sentences of The Rainbow Passage. Average cepstral measures in Praat were higher than those from ADSV. CONCLUSIONS: The current work did not find a statistical effect of age group on cepstral values; thus, normative cepstral values were reported by sex, stimuli, and software type. Future work should examine the applicability of these normative values for discriminating speakers with and without voice disorders.

Assuntos

Disfonia , Fala , Masculino , Feminino , Humanos , Acústica da Fala , Longevidade , Qualidade da Voz , Software , Medida da Produção da Fala/métodos

13.

The Relationship Between Acoustic and Kinematic Vowel Space Areas With and Without Normalization for Speakers With and Without Dysarthria.

Kuo, Christina; Berry, Jeffrey.

Am J Speech Lang Pathol ; 32(4S): 1923-1937, 2023 08 17.

Artigo em Inglês | MEDLINE | ID: mdl-37105919

RESUMO

PURPOSE: Few studies have reported on the vowel space area (VSA) in both acoustic and kinematic domains. This study examined acoustic and kinematic VSAs for speakers with and without dysarthria and evaluated effects of normalization on acoustic and kinematic VSAs and the relationship between these measures. METHOD: Vowel data from 12 speakers with and without dysarthria, presenting with a range of speech abilities, were examined. The speakers included four speakers with Parkinson's disease (PD), four speakers with brain injury (BI), and four neurotypical (NT) speakers. Speech acoustic and kinematic data were acquired simultaneously using electromagnetic articulography during a passage reading task. Raw and normalized VSAs calculated from corner vowels /i/, /æ/, /É/, and /u/ were evaluated. Normalization was achieved through z score transformations to the acoustic and kinematic data. The effect of normalization on variability within and across groups was evaluated. Regression analysis was used across speakers to assess the association between acoustic and kinematic VSAs for both raw and normalized data. RESULTS: When evaluating the speakers as three different groups (i.e., PD, BI, and NT), normalization reduced the standard deviations within each group and changed the relative differences in average magnitude between groups. Regression analysis revealed a significant relationship between normalized, but not raw, acoustic and kinematic VSAs, after the exclusion of an outlier speaker. CONCLUSIONS: Normalization reduces the variability across speakers, within groups, and changes average magnitudes affecting speaker group comparisons. Normalization also influences the correlation between acoustic and kinematic measures. Further investigation of the impact of normalization techniques upon acoustic and kinematic measures is warranted. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.22669747.

Assuntos

Doença de Parkinson , Inteligibilidade da Fala , Humanos , Medida da Produção da Fala/métodos , Acústica da Fala , Disartria/diagnóstico , Disartria/etiologia , Fenômenos Biomecânicos , Acústica , Doença de Parkinson/complicações , Fonética

14.

Reliability of a Linguistic Segmentation Procedure Specified by Systemic Functional Linguistics to Examine Extemporaneous Speech.

Gravelin, Anna C; Archer, Brent; Oddo, Mary; Whitfield, Jason A.

J Speech Lang Hear Res ; 66(4): 1280-1290, 2023 04 12.

Artigo em Inglês | MEDLINE | ID: mdl-37014996

RESUMO

PURPOSE: Extemporaneous speech tasks provide an ecologically valid sample to examine speech acoustics, but differing methodologies exist in the literature for segmentation. Therefore, the purpose of this study was to examine the utility and reliability of a segmentation approach for extemporaneous speech specified by systemic functional linguistics (SFL) and its potential research and clinical applications. METHOD: Ten speakers without communication disorders served as participants in this study, and they responded to self-selected extemporaneous speaking prompts. Two expert analysts and one clinician analyst utilized a segmentation procedure specified by SFL to segment the extemporaneous speech samples into clauses and clause complexes. Intra- and interrater reliability were calculated for each analyst and pair of analysts. Acoustic measures of duration, speech rate, and intercomplex pause durations were calculated for each clause complex. RESULTS: Analyses for both intra- and interrater reliability revealed high percent agreement that was significantly greater than chance for expert and clinician analysts and between each pair of analysts (p < .001). Acoustic analyses revealed expected variation in number and duration of spoken syllables of clause complexes between and within speakers. CONCLUSIONS: The segmentation approach for extemporaneous speech specified by SFL is a reliable method for trained analysts that is informed by lexico-grammar and allows for acoustic measurement of speech production. It is also a reliable method for clinician analysts for speakers without communication disorders, and future work will investigate its utility for speakers with motor speech disorders. SUPPLEMENTAL MATERIAL: https://doi.org/10.23641/asha.22357138.

Assuntos

Linguística , Fala , Humanos , Reprodutibilidade dos Testes , Medida da Produção da Fala/métodos , Acústica da Fala

15.

The reliability of simultaneous versus individual data collection during stuttering assessment.

Davidow, Jason H; Ye, Jun; Edge, Robin L.

Int J Lang Commun Disord ; 58(4): 1251-1267, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-36861494

RESUMO

BACKGROUND: Speech-language pathologists often multitask in order to be efficient with their commonly large caseloads. In stuttering assessment, multitasking often involves collecting multiple measures simultaneously. AIMS: The present study sought to determine reliability when collecting multiple measures simultaneously versus individually. METHODS & PROCEDURES: Over two time periods, 50 graduate students viewed videos of four persons who stutter (PWS) and counted the number of stuttered syllables and total number of syllables uttered, and rated speech naturalness. Students were randomly assigned to one of two groups: the simultaneous group, in which all measures were gathered during one viewing; and the individual group, in which one measure was gathered per viewing. Relative and absolute intra- and inter-rater reliability values were calculated for each measure. OUTCOMES & RESULTS: The following results were notable: better intra-rater relative reliability for the number of stuttered syllables for the individual group (intraclass correlation coefficient (ICC) = 0.839) compared with the simultaneous group (ICC = 0.350), smaller intra-rater standard error of measurement (SEM) (i.e., better absolute reliability) for the number of stuttered syllables for the individual group (7.40) versus the simultaneous group (15.67), and better inter-rater absolute reliability for the total number of syllables for the individual group (88.29) compared with the simultaneous group (125.05). Absolute reliability was unacceptable for all measures across both groups. CONCLUSIONS & IMPLICATIONS: These findings show that judges are likely to be more reliable when identifying stuttered syllables in isolation than when simultaneously collecting them with total syllables spoken and naturalness data. Results are discussed in terms of narrowing the reliability gap between data collection methods for stuttered syllables, improving overall reliability of stuttering measurements, and a procedural change when implementing widely used stuttering assessment protocols. WHAT THIS PAPER ADDS: What is already known on the subject The reliability of stuttering judgments has been found to be unacceptable across a number of studies, including those examining the reliability of the most popular stuttering assessment tool, the Stuttering Severity Instrument (4th edition). The SSI-4, and other assessment applications, involve collecting multiple measures simultaneously. It has been suggested, but not examined, that collecting measures simultaneously, which occurs in the most popular stuttering assessment protocols, may result in substantially inferior reliability when compared to collecting measures individually. What this paper adds to existing knowledge The present study has multiple novel findings. First, relative and absolute intra-rater reliability were substantially better when stuttered syllables data were collected individually compared to when the same data were collected simultaneously with total number of syllables and speech naturalness data. Second, inter-rater absolute reliability for total number of syllables was also substantially better when collected individually. Third, intra-rater and inter-rater reliability were similar when speech naturalness ratings were given individually compared to when they were given while simultaneously counting stuttered and fluent syllables. What are the potential or actual clinical implications of this work? Clinicians can be more reliable when identifying stuttered syllables individually compared to when they judge stuttering along with other clinical measures of stuttering. In addition, when clinicians and researchers use current popular protocols for assessing stuttering that recommend simultaneous data collection, including the SSI-4, they should instead consider collecting stuttering event counts individually. This procedural change will lead to more reliable data and stronger clinical decision making.

Assuntos

Gagueira , Humanos , Reprodutibilidade dos Testes , Índice de Gravidade de Doença , Fala , Medida da Produção da Fala/métodos , Gagueira/diagnóstico

16.

Developmental Cut-Points for Atypical Speech Intelligibility in Children With Cerebral Palsy.

Hustad, Katherine C; Mahr, Tristan J; Soriano, Jennifer U; Rathouz, Paul J.

J Speech Lang Hear Res ; 66(8S): 3089-3099, 2023 08 17.

Artigo em Inglês | MEDLINE | ID: mdl-36892950

RESUMO

PURPOSE: Early identification of speech motor involvement (SMI) in children with cerebral palsy (CP) is difficult because of overlapping features with many aspects of typical speech development. Quantitative measures of speech intelligibility have the potential to differentiate between children with SMI and those with no SMI (NSMI). We examined thresholds for speech intelligibility development in children with CP relative to the low end of age-specific typical developmental expectations. We sought to determine whether there were intelligibility differences between children with CP and NSMI versus typically developing (TD) age-mates across the range of development and whether there were differences between children with CP who have NSMI and those with CP who have SMI across the range of development based on speech intelligibility. METHOD: We used two large existing data sets that included speech samples from children between the ages of 2.5 and 8 years. One data set included 511 longitudinal speech samples from children with CP; the other included 505 cross-sectional speech samples from TD children. We examined receiver operating characteristic curves and sensitivity/specificity results by age for differentiating among groups of children. RESULTS: TD children versus those with CP and NSMI showed differentiation in their speech intelligibility across all ages, but the strength of differentiation was only marginally above chance. Children with CP and NSMI showed clear differentiation in their speech intelligibility from those with CP and SMI beginning at the earliest age point. Children with CP who have intelligibility below 40% at the age of 3 years have a very high probability of having SMI. CONCLUSIONS: Early intelligibility screening should be performed in children diagnosed with CP. Those with intelligibility below 40% at 3 years of age should be referred immediately for speech assessment and treatment.

Assuntos

Paralisia Cerebral , Inteligibilidade da Fala , Humanos , Criança , Pré-Escolar , Paralisia Cerebral/complicações , Medida da Produção da Fala/métodos , Estudos Transversais , Sensibilidade e Especificidade

17.

Acoustic Measures of Dysphonia in Amyotrophic Lateral Sclerosis.

Maffei, Marc F; Green, Jordan R; Murton, Olivia; Yunusova, Yana; Rowe, Hannah P; Wehbe, Farah; Diana, Kathleen; Nicholson, Katharine; Berry, James D; Connaghan, Kathryn P.

J Speech Lang Hear Res ; 66(3): 872-887, 2023 03 07.

Artigo em Inglês | MEDLINE | ID: mdl-36802910

RESUMO

PURPOSE: Identifying efficacious measures to characterize dysphonia in complex neurodegenerative diseases is key to optimal assessment and intervention. This study evaluates the validity and sensitivity of acoustic features of phonatory disruption in amyotrophic lateral sclerosis (ALS). METHOD: Forty-nine individuals with ALS (40-79 years old) were audio-recorded while producing a sustained vowel and continuous speech. Perturbation/noise-based (jitter, shimmer, and harmonics-to-noise ratio) and cepstral/spectral (cepstral peak prominence, low-high spectral ratio, and related features) acoustic measures were extracted. The criterion validity of each measure was assessed using correlations with perceptual voice ratings provided by three speech-language pathologists. Diagnostic accuracy of the acoustic features was evaluated using area-under-the-curve analysis. RESULTS: Perturbation/noise-based and cepstral/spectral features extracted from /a/ were significantly correlated with listener ratings of roughness, breathiness, strain, and overall dysphonia. Fewer and smaller correlations between cepstral/spectral measures and perceptual ratings were observed for the continuous speech task, although post hoc analyses revealed stronger correlations in speakers with less perceptually impaired speech. Area-under-the-curve analyses revealed that multiple acoustic features, particularly from the sustained vowel task, adequately differentiated between individuals with ALS with and without perceptually dysphonic voices. CONCLUSIONS: Our findings support using both perturbation/noise-based and cepstral/spectral measures of sustained /a/ to assess phonatory quality in ALS. Results from the continuous speech task suggest that multisubsystem involvement impacts cepstral/spectral analyses in complex motor speech disorders such as ALS. Further investigation of the validity and sensitivity of cepstral/spectral measures during continuous speech in ALS is warranted.

Assuntos

Esclerose Amiotrófica Lateral , Disfonia , Humanos , Adulto , Pessoa de Meia-Idade , Idoso , Disfonia/diagnóstico , Disfonia/etiologia , Esclerose Amiotrófica Lateral/complicações , Acústica da Fala , Qualidade da Voz , Acústica , Medida da Produção da Fala/métodos

18.

A Comprehensive Analysis of Speech Disfluencies in Autistic Young Adults and Control Young Adults: Group Differences in Typical, Stuttering-Like, and Atypical Disfluencies.

Pirinen, Veera; Loukusa, Soile; Dindar, Katja; Mäkinen, Leena; Hurtig, Tuula; Jussila, Katja; Mattila, Marja-Leena; Eggers, Kurt.

J Speech Lang Hear Res ; 66(3): 832-848, 2023 03 07.

Artigo em Inglês | MEDLINE | ID: mdl-36763844

RESUMO

PURPOSE: The purpose of this study was to examine the nature of speech disfluencies in autistic young adults and controls by using a wide-range disfluency classification of typical disfluencies (TD; i.e., filled pauses, revisions, abandoned utterances, and multisyllable word and phrase repetitions), stuttering-like disfluencies (SLD; i.e., sound and syllable repetitions, monosyllable word repetitions, prolongations, blocks, and broken words), and atypical disfluencies (AD; i.e., word-final prolongations and repetitions and atypical insertions). METHOD: Thirty-two autistic young adults and 35 controls completed a narrative telling task based on socially complex events. Frequencies of total disfluencies, TD, SLD, AD and stuttering severity were compared between groups. RESULTS: The overall frequency of disfluencies was significantly higher in the autistic group and significant between-group differences were found for all disfluency categories. The autistic group produced significantly more revisions, filled pauses, and abandoned utterances, and each subtype of SLD and AD than the control group. In total, approximately every fourth autistic participants scored at least a very mild severity of stuttering, and every fifth produced more than three SLD per 100 syllables. CONCLUSIONS: Disfluent speech can be challenging for effective communication. This study revealed that the speech of autistic young adults was highly more disfluent than that of the controls. The findings provide information on speech disfluency characteristics in autistic young adults and highlight the importance of evaluating speech disfluency with a wide-range disfluency classification in autistic persons in order to understand their role in overall communication. The results of this study offer tools for SLPs to evaluate and understand the nature of disfluencies in autistic persons.

Assuntos

Transtorno Autístico , Gagueira , Humanos , Adulto Jovem , Fala , Medida da Produção da Fala/métodos , Distúrbios da Fala

19.

Perceptual measurement of articulatory goodness in young children: Relationships with age, speech sound acquisition, and intelligibility.

Sakash, Ashley; Mahr, Tristan J; Hustad, Katherine C.

Clin Linguist Phon ; 37(12): 1141-1156, 2023 Dec 02.

Artigo em Inglês | MEDLINE | ID: mdl-36592037

RESUMO

Speech language pathologists regularly use perceptual methods in clinical practice to assess children's speech. In this study, we examined relationships between measures of speech intelligibility, clinical articulation test results, age, and perceptual ratings of articulatory goodness for children. We also examined the extent to which established measures of intelligibility and clinical articulation test results predicted articulatory goodness ratings, and whether goodness ratings were influenced by intelligibility. A sample of 164 (30-47 months) typically developing children provided speech samples and completed a standardised articulation test. Single word intelligibility scores and ratings of articulatory goodness were gathered from 328 naïve listeners; scores on a standardised articulation test were obtained from each child. Bivariate Pearson correlation, linear regression, and linear mixed effects modelling were used for analysis. Results showed that articulatory goodness ratings had the highest correlation with intelligibility, followed by age, followed by articulation score. Age and clinical articulation scores were both significant predictors of goodness ratings, but articulation scores made only a small contribution to prediction. Articulatory goodness ratings were substantially lower for unintelligible words compared to intelligible words, but articulatory goodness scores increased with age at the same rate for unintelligible and intelligible words. Perceptual ratings of articulatory goodness are sensitive to developmental changes in speech production (regardless of intelligibility) and yield a different kind of information than clinical articulation scores from standardised measures.

Assuntos

Fonética , Inteligibilidade da Fala , Criança , Pré-Escolar , Humanos , Cognição , Medida da Produção da Fala/métodos , Transtornos da Articulação

20.

Dissociating COVID-19 from other respiratory infections based on acoustic, motor coordination, and phonemic patterns.

Talkar, Tanya; Low, Daniel M; Simpkin, Andrew J; Ghosh, Satrajit; O'Keeffe, Derek T; Quatieri, Thomas F.

Sci Rep ; 13(1): 1567, 2023 01 28.

Artigo em Inglês | MEDLINE | ID: mdl-36709368

RESUMO

In the face of the global pandemic caused by the disease COVID-19, researchers have increasingly turned to simple measures to detect and monitor the presence of the disease in individuals at home. We sought to determine if measures of neuromotor coordination, derived from acoustic time series, as well as phoneme-based and standard acoustic features extracted from recordings of simple speech tasks could aid in detecting the presence of COVID-19. We further hypothesized that these features would aid in characterizing the effect of COVID-19 on speech production systems. A protocol, consisting of a variety of speech tasks, was administered to 12 individuals with COVID-19 and 15 individuals with other viral infections at University Hospital Galway. From these recordings, we extracted a set of acoustic time series representative of speech production subsystems, as well as their univariate statistics. The time series were further utilized to derive correlation-based features, a proxy for speech production motor coordination. We additionally extracted phoneme-based features. These features were used to create machine learning models to distinguish between the COVID-19 positive and other viral infection groups, with respiratory- and laryngeal-based features resulting in the highest performance. Coordination-based features derived from harmonic-to-noise ratio time series from read speech discriminated between the two groups with an area under the ROC curve (AUC) of 0.94. A longitudinal case study of two subjects, one from each group, revealed differences in laryngeal based acoustic features, consistent with observed physiological differences between the two groups. The results from this analysis highlight the promise of using nonintrusive sensing through simple speech recordings for early warning and tracking of COVID-19.

Assuntos

COVID-19 , Humanos , COVID-19/diagnóstico , Fala/fisiologia , Acústica , Ruído , Medida da Produção da Fala/métodos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA